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INTEGRASE TETRAMERS 



HELD OF THE INVENTION 

Present invention includes a mammalian cellular protein that associates with 
integrate (integrase interacting protein), and antibody or antisense inhibition of said 
integrase interacting protein. The novel integrase interaction protein is a target for HIV 
replication prevention or inhibition. 



SUMMARY OF THE INVENTION 

In a study of HIV-1 integrase (IN) completes derived from nuclei of human cells 
stably expressing the viral protein from a synthetic gene it was demonstrated that in the 
nuclear extracts IN exists as part of a large distinct complex with apparent Stokes radius 
of 61 A, which dissociates upon dilution yielding a core molecule of 41 A- The IN 
complexes were isolated from cells expressing FLAO-tagged IN. By present invention it 
was demonstrated that the 41 A core is a tetramer of IN, whereas 61 A molecules are 
composed of IN tetramers associated with a cellular protein with an apparent molecular 
weight of 76 kDa. This integrase interacting protein (Inip76) was found to be identical to 
LEDGF/DFS70/p75, a protein implicated in regulation of gene expression and cellular 
stress-response. HIV-1 IN and Inip76 co-localized in the nuclei of human cells stably 
expressing IN. Furthermore, it has been demonstrated by present invention that 
recombinant Inip76 strongly promoted strand-transfer activity of HIV-1 IN in vitro. Our 
findings reveal that the minimal IN molecule in human cells is a tetramer and and clearly 
demonstrates that Inip76 is likely to play a role in retroviral integration. 
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BACKGROUND OF THE INVENTION 

5 Establishment of the provirus, a DNA copy of the viral genome integrated into the 

host cell chromosome is an obligatory step in retroviral replication. Stable integration into 
me human genome is one of the primary reasons for the persistence of the human 
immunodeficiency virus (HIV) infection, which leads to the acquired immunodeficiency 
syndrome (AIDS). Therefore, HIV IN, the enzyme orchestrating the insertion of the DNA 
10 replica of the viral genome into the cellular chromosomal DNA is a potential target for 
anruTrtroviral therapy. The recent discovery of the new lead compounds able to protect 
cells against HIV infection by inhibiting integration raises hope that integrase inhibitors 
will be added to future combination cocktails (for reviews see Debyser et aL, 2002; Miller 
and Hazuda, 2001; Fani and Marongiu, 2000; Pommier et aL, 2000)). 

15 

Mechanistically and structurally, retroviral integrases (reviewed in Asante- Appiah 
and Skalka, 1997; Brown et aL, 1987; Craigie, 2001; ffindmarsh and Leis, 1999) are 
similar to the well-studied prokaryoticMu and Tn5 transposases and belong to a femily of 
DNA strand transferases that catalyze DNA strand cleavage/Ugation via direct frans- 

20 esterification. In the course of viral integration, HTV IN performs two enzymatic 
reactions: first is the removal of the 3'- GT dinucleotides from the long terminal repeats 
(LTRs) 0-e. the 3 '-processing reaction). The second reaction is the insertion of the 
recessed 3'- viral DNA ends into the opposite strands of the target DNA wherein the 3' 
hydroxyls of the processed LTR ends attack two phosphodiester bonds in the target DNA 

25 molecule (the strand-transfer reaction). Insertion of the two viral LTRs takes place in a 
coordinated fashion across the major groove of the target DNA As a result, the integrated 
provirus is flanked by two 5-nucleotide gaps as well as two mismatched 5>-AC 
dinucleotides, which are probably repaired by cellular enzymes (Voder and Bushman, 
2000). HTV IN, like other retroviral integrases, is comprised of three structural domains. 

30 The amino terminal domain contains the HHCC zinc finger motif, coordinates Zn and 
appears to be important for formation of the active IN multimers. The central, catalytic or 
"core" domain (residues 50-212) contains the active site (i.e. the DDE motif composed of 
D64, D116 and E1S2) and is structurally very similar to the catalytic domains of Mu 



10050608 ZFoet-82 10:39 



phage and Tn5 transposases. In the context of the full-length enzyme, the core domain is 
thought to make sequence-specific contacts with the DNA. The carboxy terminal 
fragment possesses unspecific DNA-binding activity and is critical for multimsrizafcion of 
the enzyme. All single domain fragments of IN are dimeric when produced separately; the 

5 full-length HIV IN as well as other retroviral integrates form multimers under the 
conditions of the in vitro enzymatic assays (Ellison et aL, 1995; Engelman et aL, 1993). 
Recombinant HIV and avian sarcoma virus (AS V) integrases have been shown to behave 
as mixtures of monomers, dimers and tetramers (Coleman et aL, 1999; Deprez et aL, 
2000; Jenkins et aL, 1996). Hie presence of octamers and larger complexes has been 

10 suggested by some studies (Lee et aL, 1997; Leh et aL, 2000), Virion-associated HIV IN 
was shown to be in an oligomeric form, whereby dimers and higher order multimers 
appeared to be stabilized by disulfides, although the complexes were not studied under 
native conditions (Petit et aL, 1999). Earlier, IN derived from avian myoblastosis virus 
(AMV) particles was demonstrated to be muMmeric, behacving at least as a dimer 

15 (GrandgenettetaL,l978). 

The 3 '-processing and strand-transfer reactions can be reproduced in vitro using 
recombinant enzyme preparations and DNA substrates that mimic LTR ends. In vivo 
retroviral integration is preceded by the assembly of a stable and compact prdntegration 
complex (PIC), that contains a 9.5 kb linear DNA copy of the viral genome associated 

20 with viral and cellular proteins (Brown et aL, 1987; Bukiinsky et aL, 1993; Ellison et aL, 
1990; Farnet and Haseltine, 1991; Miller et al., 1997). Several cellular proteins have been 
suggested to play auxiliary roles during retroviral integration. Thus, the barrier-to- 
autointegration factor (BAF) has been reported to protect Moloney murine leukemia virus 
(MoMLV) PICs against suicidal self-integration (Lee and Craigie, 199S). Another cellular 

25 protein, HMCH(Y) was found in HIV PICs and appeared to be essential for their 
integration activity in vitro (Farnet and Bushman, 1997; Miller et aL, 1997). Conversely, 
BAF could substitute for HMG-I(Y) at least in vitro, partially restoring integration 
activity of salt-denatured EffV-l PICs (Chen and Engelman, 1998). Yet, BAF remains to 
be shown to co-fractionate with retroviral PICs. Both BAF and HMG-I(Y) are small 

30 DNA-binding proteins able to bridge and deform DNA molecules (Reeves and 
Beckerbauer, 2001; Zheng et aL, 2000). They are thought to play structural roles within 
retroviral PICs, possibly juxtaposing both LTRs. Similarly, Mu phage transposase as well 
as enzymatf caUy<mrelated X phage integrase require the DNA-bending host proteins IHF 
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aad/or HU to form committed synaptic complexes (Friedman, 1992; Xvfizimchi, 1992). 
IHFand HU appear to substitute for each other in promoting formation of the active X 
integrase-DNA complexes in vitro (Segall et aL, 1994). Another potential co-factor for 
HIV integration, the integrase interactor 1 (mil or hSNFS/BAF47), was originally 
5 discovered in a yeast two-hybrid screen for human proteins interacting with HTV-1 IN 
(Kalpana et aL, 1994). Inil is a subunit of the 2MDa SWI/SNF chromatm-remodeling 
complex (Wang et aL, 1996). It has been proposed that Ml plays a role during retroviral 
replication by directing the PICs to open chromatin regions (Kalpana et aL, 1994) or by 
modulating expression of the integrated provirus (Tureffi et aL, 2001). Recent studies 
10 demonstrated that GFP-tagged Inil was exported from the nuclei of infected cells and co- 
localized with incoming sub-viral particles (Turelli et aL, 2001). Inil has also been 
reported to enhance the release of infectious HTV particles; detectable amounts of Inil 
have been shown encapsulated within virions (Yung et aL, 2001). 

Using a synthetic gene, we have been able to achieve efficient expression of 
15 HTV-1 IN in human cells (Cherepanov et aL, 2000). By present invention we have now 
characterized HIV-1 IN protein complexes present in nuclear extracts from cells stably 
expressing this viral protein. For the time we reported a HTV mtegrase-irteracting protehj 
that forms a distinct complex with IN in human cells and provided an insight into the 
oligomeric state of intracellular HTV IN, suggesting that the minimal cellular IN complex 
20 is a tetramer. It is an object of present invention to prevent or inhibit the integration of the 
HTV* genome into the host cell chromosomes by inhibiting the interaction between the 
integrase interacting protein and integrase. 
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ILLUSTRATIVE EMBODIMENTS OF THE INVENTION 



Examples 
5 Methods 

Example 1 Recombinant DNA 

The HTV-1 mtegrase expression constructs were based on the episomal pCEP4 vector 
(Invitrogen, Groningen, The Netherlands). The plasmid pCEP-EtSPala is identical to the 

1 0 published pCEP-IN* plasmid (Cherepanov et aL 1 2000) with the difference that the Gly 
codon in the second position of the synthetic open reading frame CORF) was mutated to 
Ala. As a result, the construct expressed native HTV-1 IN with an addition of Met-Ala 
dipeptide at the N-terminus. To create the FLAG epitope-tagged IN expression construct 
pCEP-lN*alaFLAG f the IN synthetic ORF from pCEP-IN*ala was amplified in two 

15 consecutive steps with the sense primer 5'-GGCTAGATATCACTAGCAAC 

CTCAAACAG phis two anti-sense primers 5'-GTCGTCCTTGTAATCGCCGTCC 
TCATCTTGACGAGAG and 5 ' -GGCGCTCGAGTT ACTTGTC ATCGTCGT CCTTGT 
AATCGC, the resulting PCR fragment was digested with Xhol and cloned between the 
EcoRI (blunt) and Xhol sites of the pCEP4 vector. This plasmid expressed HTV-1 IN 

20 carrying the C-terminal FLAG epitope (pYKDDDDK). The plasmid pKP1012, for 
bacterial expression of Hiss-tagged HTV-1 IN, was a gift of Dr. R. Plasterk (The 
Netherlands). To obtain pCP6H75, the plasmid used for bacterial expression of Hiss- 
tagged Inip76/LEDGF, the PCR fragment amplified from a HeLa cDN A with the primers 
S'-GGCCGGATCCGACTCGCGATTTCAAACCTGGAGAC and S'-CCGCGAATTCT 

25 AGTTATCTAGTGTAGAATCCTTC was digested with BamHI and EcoRI and cloned 
between the BamHI and EcoRI sites of pRSETB (Invitrogen). To prepare the mini-HIV 
IN substrate, the plasmid pU3U5 (Cherepanov et al., 1999) was digested with Seal 

Example 2: Cell Culture 
30 The human embryonic kidney cells expressing SV40 large T antigen, 293T were obtained 
from Dr. O. Danos (Evry, France). Cells were grown in Dulbecco's modified Eagle's 
medium (DMEM) (Invitrogen) supplemented with 10% fetal calf serum (FCS), 2 mM 
ghitamine and 20 ug/ml gentamicin at 37*C in 5% CO2 humidified atmosphere. To 
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establish stable cell lines, 293T cells were transfected by electroporarion with integrate 
expression constructs and selected with 200 ug/ml of hygromycin B (favhrogen). The 
selected cell lines, 293T-IN , ala and 293T-Itf alaFLAG, expressed IN at the levels similar 
to the previously reported 293T-IN* cells (Cherepanov et al., 2000) as determined by 
western blotting and immunofluorescence. IN expression was stable for over 40 passages. 
For radioactive immunoprecipitation experiments cells were labeled in 
methionine/ cysteine-firee DMEM (Invitrogea) supplemented with 10% dialyzed PCS phis 
0.1 mCi/ml of TRAN 35 S -LABEL (ICN Biomedicals, Asse-Rclegem, Belgium) for 
24 hours. 



Example 3: Preparation of Nuclear Sxtrartx 

293T-IN s ak or 293T-HN*alaFLAG cells grown to a confluency of 80-90% were 
dissociated from plastic dishes using trypsin, washed with PBS and re-suspended in the 
modified cytoskeleton buffer (10 mM Pipes [pH 6.8], 10% (w/v) sucrose, 1 mM DTT, 

15 1 mM MgCfe plus the EDTA-free protease inhibitor cocktail (Roche, Brussels, Belgium)) 
(Fujita et al, 1997) containing 100 mM NaCl (referred to as 100CSK buffer). Cells were 
lysed for 10 min on ice with 0.5% NP40; nuclei were pelleted and washed with 100CSK. 
To extract IN, nuclei were re-suspended in the 400CSK buffer (same as 100CSK, but 
combining 400 mM NaCI) and left on ice for 5 min, the chromatin was removed by 

20 centrifogation at 7,500 rpm for 2 min. The total protein content of the nuclear extracts 
was measured using the BCA protein assay (Pierce, Rockford, IL), with bovine serum 
albumin (BSA) as the standard. 



Example 4 : Chemical C ross-linldng 

The nuclear extracts were diluted using 400CSK buffer to adjust the total protein 
concentration to 100, 20 or 4 ug/ml. DTSSP (Pierce, Erembodegem-Aalst, Belgium) was 
dissolved in water immediately prior to the experiment and used at 2, 0.5 or 0.1 mM. 
Cross-linking reactions were allowed to proceed for 15 rain at room temperature, were 
terminated by addition of Y* volume of the 4 x SDS sample buffer (200 mM Tris HC1 [pH 
6.8], 4% SDS and 40% (v/v) glycerol) and incubation at room temperature for 20 min. 
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Krampb* 5: Gel Filtration Chromatography 

Nuclear extracts and immunopurified INf were fractionated on a Superdex 200 HR 10/30 
gel filtration cohunn (Amereham-Phannacia). The 400CSK buffer was used in all 
chromatography experiments. The column 'was operated at 0.6 ml/rain, 4"C, and 
S calibrated using low and high MW gel filtration standards from Amersham-Pharmacia 
(blue dextran; thyroglobulin, Rs/Mw 85 A/669 fcDa; femtin, 61 A/440 kDa; catalase, 
52.2 A/232 kDa; aldolase, 48.1 A/158 kDa; BSA, 35.5 A/67 Jd>a; diyixKrtiypsUiogen A, 
20.9 A/25 kDa). The sample volume was kept at 200 jil; fractions of 300 jjJ were 
collected and analyzed by western blotting using polyclonal anti-IN antibodies. When 
10 necessary, gel filtration fractions were concentrated by precipitation with trichloroacetic 
acid. Stokes radii (B*) and approximate MWs of the IN complexes were determined from 
their experimental partition coefficients (Kay) as described (Siegel and Monty, 1965). 



Example 6: Western Blottin? and Immunoprecipitation 

15 The gradient 4-12 and 4-20% Novex Tris-Glycine gels were purchased from InvHrogen. 
Proteins were transferred onto PVDF membrane (Bio-Rad, Nazareth, Belgium); detection 
was done with ECLH- (Amersham-Phannacia, Roosendaal, The Netherlands). The rabbit 
polyclonal anti-HTV-l BST were home made and used at a 0nal dilution of 1:30.000 
(Cherepanov et al, 2000). The anti-FLAG M2 monoclonal antibody was from Sigma- 

20 Aldrich, the monoclonal anti-DNA-PKcs Ab-4 cocktail from NeoMaikere (Fremont, CA) 
and the monoclonal anti-LEDGF p75/p52 from BD Biosciences (Erembodegem, 
Belgium). The affinity purified anti-hMCM3 polyclonal antibody was a gift from Dr. 
Knippers R. (Universty of Konstanz, Germany). A combination of prestained MW 
markers (New England Biolabs, Hertfordshire, UK) and Markl2 (Invitrogen) was used to 

25 estimate MWs of the cross-linking products and Inip76. DNAPKcs detected in a 293T 
nuclear lysate sample using the anti-DNAPKcs Ab4 antibody served as the 470 KDa 
marker in some western blots. 

In the initial immunopreripitation experiments, 30 pi of protein G agarose 
(Roche) and 1-3 ng of the anti-FLAG M2 antibody was added to the nuclear extracts 

30 prepared in 400CSK and diluted to obtain total protein concentration of 200 yg/ral; the 
suspension was strirred at 4 P C overnight (12-18 hours). The agarose beads were washed 
once with 400CSK and 4 times with 100CSK plus 0.1% NP40. Protein was eluted in 
400CSK buffer by addition of 200 pg/ml FLAG peptide (Sigma-Aldrich) or in 
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SDS PAGE sample buffer. To purifiy LEDGF-INf complies, immiraoprecipitation was 
carried out using undiluted nuclear extracts (700 ug/rol total protein) for 3-5 hours. To 
identify the Mp76 protein by N-terminal sequencing and mass spectrometry, the 
procedure was unsealed. 293T-IN* alaFLAG cells were grown to confluency on five 
500 cm 2 dishes (VWR International, Leuven, Belgium), lysed with 0.5% NP40. IN 
complexes were extracted from the nuclear pellets into 13 ml of the 400CSK buffer and 
incubated with 300 ul of protein G agarose beads and 40 ug of the anti-FLAG M2 
antibody for 4.5 hours. The INr complexes were eiuted in 700 ul of the 400CSK buffer 
•with 200 ug/ml FLAG peptide. 



Example 7 : Amino Terminal Xecmenciny ap dA4ass Spvctmnrny 
Imnmnopurified INrInip76 complexes were precipitated with trichloroacetic acid and 
redissolved in the SDS PAGE sample buffer. Approximately 3 ng of the Inip76 protein, 
electroblotted onto a Sequi-Blot PVDF membrane (Bio-Pad) from an SDS PACT gel, 
15 was subjected to Edman degradation on a pulsed liquid phase Procise 491 cLC protein 
sequencer {Applied Biosystems, Lennik, Belgium). For mass spectrometry analysis the 
coomassie blue-stained band of Inip76 was cut from an SDS PAGE gel, destained in a 
200 mM ammonium bicarbonate/50% acetonitrile, air-dried and soaked in 8 jd trypsin 
solution (16 ng trypsin (Promega) in 50 mM ammonium bicarbonate) on ice for 20 nrin. 
Following overnight digestion at 37°C, the supernatant was recovered, and the gel slice 
was extracted twice using 60% acetonitiile/0.1% formic acid. The extracts and the 
supernatant were pooled and dried in a Speedvac concentrator. The peptides were re- 
dissolved in 0.1% formic acid and analysed by on-line nanoflow high performance liquid 
chromatography tandem mass spectrometry (LOMS/MS) on an UWMate capillary LC 
system (LC-Packings, Amsterdam, The Netherlands) coupled to a Q-Tof mass 
spectrometer (Micromass, Manchester, UK) equipped with an electrospray ionization 
source. Technical details of this system are reported elsewhere (Devreese et al. ( J. 
Chromatography A in press). All spectra were processed using the MassLynx and 
MaxEnt software delivered with the mass spectrometer. 



Sample g: Indirect Tmrnunaflttare^ynce Miarns^npy, 

293T-IN s alaFLAG cells grown in Lab-TekH glass chamber slides (VWR Ihternational) 
were fixed with 4% formaldehyde in phosphate buffered saline (PBS) for lOmin and 
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perraeabilized in ice-cold methanol. The cells were further blocked m PBS supplemented 
with 20 mM aminonium chloride and 10% FCS and incubated with rabbit polyclonal anti- 
FLAG antibodies (diluted 1:10.000 in PBS/10% FCS) (Sigma-Aldrich) and monoclonal 
airti-LEDGF (1:300) or anti-DNA EKcs (1:300) followed by Alexa-555 anti-rabbit and 
Alexa-488 conjugated anti-mouse IgG antibodies (Molecular Probes, Leiden, Hie 
Netherlands). The nuclear DNA was labeled with 5 pM To-Pro3 iodide (Molecular 
Probes). Confbcal laser scanning fluorescent microscopy and imaging was carried with an 
LSM510 system (Carl Zeiss, Zaventem, Belgium) using a 488 nm Ar ion laser with a 
505-530 nm band pass filter for Alexa-488, a 543 nm HeNe laser with a 565-615 nm filter 
for Alexa-555 and a 633 nm HeNe laser with a tow pass 650 nm filter for ToPjo-3. All 
acquisitions were done in the multi-track mode, 

Example 9: Recombinant proteins 

The His$-tagged HTV-1 IN was produced from the pRP1012 plasmid in the Endo I-ftee 
host Escherichia coli strain PCI (BL21(DE3), AenddiiTcP, pLysS) (Cherepanov et aL, 
1999). The protein was purified from the soluble fraction, by chromatography on Ni-NTA 
agaroBe (Qiagen, Hilden, Germany) and Heparin Sepharose (Amersham Pharmacia, 
Uppsala, Sweden) in the presence of 7,5 mM 3-[(3- 
cholaraddopropyl)dimethylanmonio]propanesulfo (CHAPS) (Sigma-Aldrich), The 
His^-tag was removed by incubation of the purified protein with thrombin (Novagen). 
The Bfiss-tagged Inip76 was expressed from the plasmid pCP6H75 in PCI by induction 
with 1 mM isopropylthiogalactopyranoside (EPTG) at 29 9 C for 3 hours* in LB medium. 
The soluble protein was then purified by batch adsorption to Ni-NTA agarose and 
chromatography on a 1 ml HiTrap Heparin Sepharose column (Amersham Phamacia). 
The protein was eluted from the Heparin Sepharose column using a linear NaCl gradient 
in 30 mM TrisHCl [pH 7.0]. Peak fractions collected at approximately 800 mM NaCl 
were pooled and concentrated using Centricon-30 (Millipore, Brussels, Belgium) to 
obtain the final protein concentration of 1 mg/ml. The protein was estimated to be at least 
95% pure by SDS PAGE. 
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Eca npte 10 Disruption of the nuclear and chromosnmnl l o calization of HIV IN using 
smaff interfering RNA molecules specific, t o the Iru p76 mRNJ 

RNA interference with synthetic short KNA duplexes has been used to knock-down 
5 expression of endogenous genes in mammalian cells (Elbashir et al., 2002). We used 
short interfering RNA (siRNA) duplexes to study the effect of depletion of Inip76 in 
293T-INsalaFLAG cells on the distribution ofHIV-1 IN 

i): siRNA Preparatinp 

10 All oligonucleotides for siRNA preparation were chemically synthetized by 

Dharmacon (CO, USA) and Xeragon (MD, USA) and were composed of ribonucleotides 
(A,U,G and C) plus a pair of 3*-temiinal desoxy-thyroidines (dTdT). 

Small interfering RNA molecules (siRNA) were prepared by annealing the 
following pairs of oligonucleotides: 

15 i) CAGCCCUGFUCCUUCAGAGA-dTdT plus UCUCUGAAGGACAGGGCUG-dTdT, 
to obtain 76 A RNA; 

i0 AGACAGCAUGAGGAAGCGA-dTdT plus UCGCUUCCUCAUGCUGUCU-dTdT, 
to obtain 76B RNA; 

iiO CAGAUGCAUUGAGGCCUUG-dTdT phis CAAGGCCUCAAUGCAUCUG-dTdT, 
20 to obtain 76C SNA 

iv) GCGCGCUUUGUAGGAUUCG-dTdT phis CGAAUCCUACAAAGCGCGC-dTdT, 

to obtain NC RNA 

The 76A, 7SB and 76C siRNA molecules are thus designed to be specific for the lhip76 
mRNA and contain an anti-sense strand that can hybridize to the target mRNA The NC 
25 molecule is an RNA duplex which is not specific to Inip76 RNA norto any other known 
human mRNA 



fL Transfectian of the 2Q5T- rtf cdaFLAG Cells with the siRNA Molecules 

293T-IN*alaFLAG cells expressing FLAGepitope tagged HEV-1 IN were seeded 
into 8 weU Nunc LabTekH chamber glass slides (VWR International) and were 
transfected at a confluency of about 25% with each siRNA using GeneSilencer, a 
liposomal transection reagent purchased ftom GTS (Gene Therapy Systems, CA, USA). 
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Transfection was done according to the manufacturer's recommendations. Briefly, for 
transfection of each well, 0.3 fig of RNA and 1.75 pi of the GeneSilencer reagent were 
nsed to transfect each well of the slide. 

Expression and distribution of IN and Imp76 was assessed by indirect 
immunofluorescence. Two days post transfection the cells were fixed with 4% 
formaldehyde in PBS for 5 minutes, followed by four washes in PBS and treated with ice- 
cold me thano l for 5 minutes. Fixed and permeabilized cells were then incubated in 
blocking solution (10 % fetal calf serum/1 0 mM ammonium chloride in PBS) for 30 
minutes at room temperature. The cells were incubated -with mouse monoclonal anti- 
human LEDGF antibody (purchased from BD Bioscience, KY, USA) diluted 1:200 in 
blocking buffer or monoclonal anti-FLAG M2 (Sigma-Aldrich) diluted 1:200 for 1 hour. 
Non-bound primary antibodies were washed in three changes of blocking buffer. Cells 
were stained for 1 hour with Alexa-488 conjugated goat anti-mouse antibody (Molecular 
Probes) diluted 1 :300 in blocking buffer and washed with blocking buffer. Chromosomal 
DNA was stained by addition of 5 jiM ToPro3 iodide (Molecular Probes) to the 
secondary antibody solution. 

Western blot analysis demonstrated that the expression levels of HIV- 1 IN in the 
cells transfected with 76A siKNA were reduced dramatically (not shown), suggesting that 
Inip76 is necessary for stability of the IN protein in human cells, likely protecting IN 
from proteolysis. 

Example H Recombinant Inn?76-IN Complex Produced i n Escherichia colt 

As elaborated above, recombinant lnip76 is a potent stimulator of HIV IN activity 
tn vitro. We wanted to reconstruct the Inip76-IN complex from recombinant proteins, as 
such complex can be potentially used to study interactions of the proteins on molecular 
level as well as an enzyme for DNA recombination in vitro and m vivo. Biochemical 
studies on HIV EST as well as other retroviral INs suffer from poor solubility of the 
proteins. We found that adding purified recombinant Inip76 to IN solutions had a 
solubilising effect on the EST protein. Thus, when both proteins were mixed at a 
concentration of 0.25 mg/ml in 150 mM NaCl/10 mM Hepes/1 mM MgCkAU % NP40. 
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Approximately 50 % of the IN protein remained soluble after 45 minutes incubation at 
4 °C and centrifugation at 20000 g for 5 minutes. 3n contrast, only about 5 % of IN was 
recovered in soluble form when Inip76 was omitted fiom the mixture. This result 
indicates that the complex between IN and Iiup76 is more soluble than free recombinant 
5 IN, and that such complex can be formed using recombinant proteins. A soluble form of 
retroviral integrase in complex with its chromosomal receptor might help to determine the 
structure of the active form of the retroviral IN. In addition, if active such complex could 
be used as an enzyme for in vitro DNA recombination. 

10 *■ Construction of the Mp76 Expression Plasmid pCP-Nat75 

The open reading frame of Inip76/LEDGF was PCR amplified from pCP6H7S using 
flw primers o*-TGACTCGCGATTTCAAACC and 5 '-CCGCGAATTCT AGTTATC 
TAGTGTAGAATCCTTC. The resulting DNA fragment was digested whh EcoBI and 
subcloned between Ndel (treated with T4 DNA polymerase to obtain blunt terminus) and 
Eam sites of the pRSBTB vector (favhrogen). The complete Mp76 open reading frame 
and the phage T7 promoter region of the resulting plasmid, pCP-NatTS, was sequenced to 
confirm that no mutation occurred. The plasmid was transformed into the E, noli PCI 
strain (E. coli BL21(DB3), J*Ktt::Tc*, P I,ysS) (Cherepanov et aL, 1999) by standard 
methods. The resulting strain, PC2LEDGF expressed Inip76 upon induction with JDPTG. 
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Construction of the IN Expression Plasmids pKB-INSH and pKB-A15IN<5H 



To construct the plasmid for bacterial expression of HIV-1 IN whh C-terminal Hiss 
tag, we amplified the foil IN open reading frame using PCR whh the primers 5'-GCGCG 
TCGACATCCTCATCCTGTCTAC and 5 AATACGACTC ACTATAGGG from the 
pINSD plasmid (obtained from The NTH AIDS Research and Reference Reagent 
Program, catalog #2820). The RCR fragment was digested with ASfel and SaU and 
subcloned between ittfel and Sail sites of pET-20b(+) (Novagen). The open reading frame 
of IN as well as the T7 promoter region of the resulting plasmid pKB-IN<5H was 
sequenced to verify absence of mutations. When the plasmid was transformed into PCI 
bacteria, it expressed HIV-1 C-terminally Hiss-tagged IN. However, pKB-IN©H is not 
able to stably co-exist with P CP-Nat75, as both plasmids have the same type of 
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replication origin. To obtain pKB-Al6IN6H, an IN-expression plaSmid compatible with 
pCP-Nat7S, we inserted the BglU/DraJO. fragment of pKB-IN6H between BamHl and 
Bsdl rates of pACYC177 vector (Chang and Cohen, 1978) (available from New England 
Biolabs) (the DnSSL and Bsal ends of the DNA fragments were treated with T4 DNA 
5 polymerase to obtain Ugatable blunt tennini). 

iii Co-expression ofInip76 and Hisg-tagged HTV-1 IN vl Escherichia coli 

The plasmid pKB-Al5IN6H was transformed into the PC2LEDGF E- coli strain 
10 (BL21(DE3), AendA:-^, pLysS, pCP-Nat75) by standard methods- After 
transformation, bacteria harboring the newly transformed plasmids were selected on LB 
agar plates containing 120 ug/ml ampicillin, 40 ug/ml kamunicin and 15 Mg/ml 
tetracycline at 28 °C. This prooedure resulted in introduction of the pKB-A15IN6H 
plasmid and knock-out of the incompatible pLysS plasmid. The resulting strain 
1 5 PC2LEDGF-C (BL21(DE3) > MndA ::Tc* pCP-Nat75, pKB-A151N6H) was grown on LB 
agar plates and in liquid LB medium supplemented with 120 ug/ml ampicillin, 40 ug/ml 
kanamicin and 15 ug/ml tetracycline at 28 °C. Expression of Inip76 and IN was induced 
by addition of 1 mM IPTG at 29 °C in LB medium. 



20 



iv Purification of the Inip76-IN Complex Expressed in£. coU 



An overnight culture of PC2LEDGF-C (100 ml) was diluted with 4 1 of fresh LB 
25 medium supplemented with 120 ug/ml ampicillin and 40 ug/ml kanamicin and grown in a 
shaking incubator at 28 °C until its optical density (measured at 600 nm) reached 
approximately 0.8, then the culture was induced with 1 mM IPTG. After 3 hours 
expression at 29 °C the bacteria were collected by centrifligation and kept frozen 
overnight at -70 °C. Next day defirozen bacteria were re-suspended in 20 ml of cell 
30 breaking buffer (500 mM NaO/30 mM Hepes/1 mM MgCfe/O.l mM PMSF) and lysed by 
two passages through a French press at a cell pressure of 18000 psi. The resulting 
bacterial lyaate was cleared by centrifugation at 15000 g for 30 nun supplemented with 25 
mM imidazol and mixed with 1.5 ml of Ni-NT A resin (Qiagen) equilibrated with the 
same buffer. The slurry was gently mixed for 30 minutes to allow His-tagged protein to 
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bind to the resin. The resin was then packed into a disposable gravity flow column and 
washed with 50ml of 400 mM NaCl/30mM Hepes/lmM MgCfe/25 mM imidazol, 
pH7.2 followed by 12 ml of 400 mM NaCI/30 mMHepes/1 mM MgCfe/35raM 
imidazol, pH 7.2. The bound protein was then eluted in the elation buffer (400 mM 
5 NaCi/30 mM Hepes/1 mM MgCl^OO mM imidazol, pH 7.2). Both IN and Mp76 doted 
in the presence of 200 mM imidazol, as can be seen from the SDS-PAGE gel shown in 
FIG. 9 A In a control experiment, carried-out with the lysate of induced PC2LEDGF, 
Inip76 was detected only in the non-bound fraction, and no Inip76 was detected in the 
fiactions eluted with 200 mM imidazol (FIG. 9B). Therefore, as was expected, in the 
10 absence of Hiss-tagged IN, Inip76 does not bind to Ni-NTA confirming that all Ittip76 
found in the samples is indeed associated with IN. 

The fractions containing IN and Inip76 were pooled together and injected into a 
lml HiTrap SP Sepharose column (Amersham-Pharmacia) equilibrated with 35% 
chromatography buffer B in A (buffer A was 1 mM MgClz/30 mM Hepes, pH 7.2, buffer 
15 B was 1 M NaO/1 mM MgCfe/30 mM Hepes, pH 7.2). The column was washed with 
35 % B at 0.7 ml/min and the bound proteins were eluted in a 1 5 ml linear gradient from 
35 % B to 90 % B. The host-derived contaminating proteins, including the 70 kDa band, 
as well as free IN protein were found in foe flow-through fraction (FIG. 9C). The Inip76- 
IN complex eluted in approximately 55 % B, corresponding to 550 mM NaCl (fiactions 
20 16-22 in FIG. 9C). The fractions containing Inip7$ and IN were pooled and concentrated 
by ultrafiltration using Centricon-30 (Millipore) to a final volume of approximately 70 ul 
(FIG. 10A). The final protein concentration was determined to be 4.8 mg/ml using BCA 
kit (Pierce) and BioRad protein assay (BioRad) with bovine serum albumin as a standard. 
The protein was kept at 4 "C for a week without signs of aggregation. In contrast, free 
HIV IN was not soluble at these concentrations in buffers with physiological pH. 
Detergents such as CHAPS orNP40 were needed to keep the protein from aggregation 
even at protein concentrations below 1 mg/ml. 
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Example 12: HTV-J IN is present in the in s oluble mnhar frnrtjr^ 
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The 293T-IN s ala cell line used in this work was similar to the previously reported 293T- 
DP, except that it expresses HTV-l IN "with the Met-AIa dipeptide at its N-terminus 
(instead of Met-Gly) (Cherepanov et aL, 2000), This change was introduced to prevent 
potential myristoylation of the protein (Boutin* 1997) and affected neither cell line 
5 stability nor IN expression levels. The integrase protein was nuclear in both cell lines as 
determined by indirect immunofluorescence microscopy (data not shown and Cherepanov 
et al., 2000). After centrifugation of the dighonin- or NP40-tteated 293T-IN 5 ala cells, 
most of the IN protein was retained in the nuclear pellet (Figure 1A), Since NP40 
penneabilizes both the plasma membrane and the nuclear envelope, the bulk of IN present 

10 in the cell is thus stably associated with insoluble nuclear structures. IN was extracted 
from the nuclei by increased salt concentrations; 350-500 mM NaCl was sufficient to 
ehite the protein (Figure IB). Therefore, buffers supplemented with 400 mM NaCl were 
used for preparing nuclear salt extracts. 

Some chromatin-associated proteins can be eluted from detergent-penneabilized 

15 nuclei by gentle treatment with DNAse I or micrococci nuclease (Fujita et al., 1997; 
Hohhoff et al., 1998; Meller and Fisher, 1995). We observed no elution of IN when nuclei 
prepared from 293T-IN*aIa cells were treated with DNase I (Figure 1C), that completely 
digested nuclear DNA to fragments of less then 200 bp (data not shown). In accordance 
with Fujita et al. (Fujita et aL, 1997), MCM3, a chromosomal replication fector, was 

20 removed from the nuclei by gentle DNase I treatment (Figure 1C). The same salt 
concentration was required to extract IN from the nuclease-digested nuclei (data not 
shown). However, subjection of the DNase-digested nuclei to low ionic strength 
conditions led to efficient elution of the protein (Figure ID). Non-digested nuclei did not 
release any detectable amount of IN in the hypotonic conditions (Figure ID). These 

25 results indicate that IN is associated both with chromosomal DNA and with some other 
nuclear structure, which is destabilized in low ionic strength conditions. Similar results 
were obtained when 293T-INfala nuclei were exposed to micrococci nuclease (data not 
shown). 

30 Example 13: Chemical Cros*-lmkin% of IN Complexes Present in Nuclear Extracts 

We used 3,3'-DilMobis[suto propionate] (DTSSP), an amme-sperific N- 

hydroxysuccinimid ester with a 12 A spacer arm, to cross-link protein complexes present 
in the nuclear extracts of 293T-IN s ala cells. The nuclear proteins were extracted from the 
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NP40-treated 293T-Dfala nuclei using the cytoskeleton buffer containing 400 mM NaCl 
(400CSK). The total protein concentration was adjusted to 100, 20 and 4 ugftnl; the 
samples were incubated with DTSSP and separated in a non-reducing 4-12% SDS PAGE 
geL IN-contahiing cross-linking adducts were detected by western blotting using 
5 polyclonal anti-IN antibodies. A typical result is shown in Figure 2 A. In the non-cross- 
Unked samples (lanes 2, 6 and 10) as well as in the samples cross-linked in the presence 
of SDS (lane 1) only IN monomer and a band corresponding to IN dimer (approximately 
60 kDa) were apparent. Addition of 0.1-2 mM DTSSP yielded cross-linked complexes of 
60 kDa (060*), 150-180 kDa (the P 150 d band, clearly visible in lanes 7, 8 and 12), 250- 
10 350 kDa (p300*\ lanes 7 and 8) and less resolved higher molecular weight (MW) species. 
Strikingly, detection of the cross-linked IN complexes with our polyclonal anti-IN 
antibodies was far more sensitive than detection of the non-cross-linked IN (compare the 
lanes 2 and 4, for example). Probably, some strong conformational epitopes are better 
preserved within cross-linked IN during SDS PAGE and western blotting. No unspecific 
15 bands were revealed in nuclear extracts from 293T cells before and after cross-linking 
with DTSSP confirming that all bands detected in the western blot correspond to IN- 
conlaimng complexes (data not shown). The cross-linking of the 293T-IN*ala nuclear 
extract was clearly dependent on the concentration of both DTSSP and protein. Saturating 
cross-linking of diluted protein samples followed by denaturing SDS PAGE has been 
20 used to determine MWs of native protein complexes (see (Corey et al, 1998) and 
references therein). Cross-linking of the diluted nuclear extract (4 ug/ml protein) with 
2 mM DTSSP yielded a band of approximately 150 kDa (plSO 6 *, lane 16), while in more 
concentrated extracts (100 ug/ml) a diffuse band of 250-300 kDa was the most prominent 
(P300* 1 , lane 8). Thus, there exist at least two different IN complexes: a large complex at 
25 higher protein concentrations and a smaller complex in the diluted extract. The pl50 d 
product seems to be the result of complete cross-J&nkrag since no significant change in 
cross-linking occurs when the concentration of DTSSP was increased from 0.5 to 2 mM 
(compare lanes 15 and 16) and higher (not shown). Thus, plSCf* probably represents the 
IN complex present in the diluted nuclear extract Moreover, this complex is a 
dissociation product and importantly, a component of the larger complex, since it 
appeared as a partial cross-hnkmg adduct in the reactions with more concentrated protein 
extracts at 0.5 mM DTSSP and it decreased at 2 mM DTSSP (compare lanes 7 and 8). 
However, the p300* band is probably not the result of complete cross-linking of the larger 
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complex, since a strong smear and some less resolved bands are present above p3O0°* on 
the western blot; aggregated material not able to enter 4% acryiamide gel is also evident 
(lanes 8 and 12). Some of die high MW adducts in the reactions with 100 jig/ml extracts 
may result from non-specific faxter-molecular csxoss-iinkiiDg of proteins. Cross-linking of 
5 IN complexes in the nuclear extracts using oxidizing Cu 2+ -[1, 10-phenanthroline]3 
complex (Cys-Cys cross-linker (Ji, 1983)) were also suggestive for the presence of a large 
protein complex that dissociated upon dilution releasing a molecule of approximately 
150 kDa (data not shown). 

10 Example. 14; Apparent Stokes Radii of the two Nuclear Integrate Complexes 

To confirm the presence of both IN complexes and deduce their size we used gel 
filtration. Undiluted (550 yg/ml protein) and diluted (30 (igtal) nuclear salt extracts from 
293T-I>Fala cells were run on a calibrated Superdex 200 column and the IN elation was 
followed by irnmunoblotting the collected fractions (Figure 3A). We observed two 

15 distinct elution volumes corresponding to two different IN complexes. After 
chromatography of the undiluted extract, IN eluted symmetrically with a peak maximum 
in fractions 8 and 9 (elution volume, V e - 11.2 ml; Stokes radius, = 61 A, see Figure 
3B). However, IN eluted later and in a broader peak with a V* of 14.1 ml (1^ = 41 A, 
Figures 3 A and 3B) when the sample was diluted prior to gel filtration. Assuming both 

20 complexes are globular, their MWs can be calculated to be 380 kDa and 115 kDa 
respectively (Figure 3Q, The smaller dilution-resistant molecule (^°4lA) most likely 
corresponds to the pl50°* cross-linked complex observed in the previous experiment, 
while partial cross-linking of the 61 A IN complex probably resulted in p30Q cl . The 
effective dilution of the peak fractions on the column was 5- to 7- fold as calculated from 

25 calibration, runs (data not shown). Thus, the sample concentrations used in our gel 
filtration experiments corresponded to those used in the cross-linking experiments 
(100 tig/ml arid 4 ng/ml). 

Example 15: Purification and Characterization of FLAG-Tazzed IN Complexes 
30 To facilitate isolation of native IN complexes from cell extracts we modified the IN 
expression construct adding the FLAG epitope tag at the carboxy terminus of IN. The 
293 T-Etf B alaFL AG cell line, obtained by stable transfection of 293T cells with the tagged 
expression construct, was very similar to 293T-IN*ala in stability and levels of IN 
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expression (not shown). FLAG-tagged IN (INt) localized predominantly in the nuclei in a 
diffuse pattern and was associated with chromosomes Awing mitosis (see below) as has 
been previously reported for non-tagged HTV-1 IN (Cherepanov et aL, 2000). IN, could 
be extracted from the nuclei of 293T-IN 5 alaFLAG cells in the same conditions as for non- 
5 tagged IN. The cross-linking pattern of INr with DTSSP was very similar to that of non- 
tagged IN (Figure 2B). The two major cross-linking products of IN* showed slightly 
slower migration in SDS PAGE gels than the original pl50 cl and p300* which can be 
attributed to the negative charge of the FLAG tag and increased molecular weight of the 
tagged protein. For convenience, however, we refer to the INr cross-Unking adducts as 
10 pl50 d and p300 d . The gel filtration profiles were as observed for the non-tagged IN 
extracted from 293T-IN*ala cells (see below and data not shown). 

In initial imrnunoprecipitation experiments, we incubated diluted nuclear extracts 
from metabolicaily labeled 293T-IN°alaFLAG cells with the axrti-FLAG M2 antibody and 
protein G agarose overnight. The protein isolated in this way displayed a single specific 
band in SDS PAGE gels migrating at the expected position for the FLAG-tagged IN 
(33.5 kDa) (Figure 4A). Isoelecttofocusing of imrmmoprecipitated Dfc in denaturing pH 
gradients showed a major band close to the predicted pi (6.5), which reacted with anti-IN 
serum in immunoblot (data not shown). When the IN, immunoprecipitated from a nuclear 
extract of 293T~Itf alaFLAG cells was eluted from the anti-FLAG M2 antibody with 
synthetic FLAG peptide and incubated with DTSSP, the plsC cross-linking product was 
readily obtained (Figure 4B). When higher INr concentrations were used in cross-Jinking, 
the immunoreactive reaction products accumulated at the top of the gel, suggesting 
aggregation of the protein (data not shown). We were not able to find reaction conditions 
to reproduce the p300« cross-linking product with INr preparations purified this way. 
Fractionation of purified IN f on a Superdex column showed a peak with a K, v value very 
close to that of the 41 A complex (Figure 4C). The fact that the 41 A complex exists in 
the purified INf preparation and the apparent MW (115-150 kDa), based on cross-linking 
and gel filtration experiments, suggest mat the 41 A molecule is a tetramer of IN. 
Apparently, the native 61 A IN complex was not stable enough to withstand 
imrnunoprecipitation under these conditions. Next, we tried shorter imrnunoprecipitation 
times (3-5 hours) starting from more concentrated nuclear extracts. Although the overall 
yield of INr decreased, the immunoprecipitated samples were found to contain an 
additional protein. It had an apparent MW of approximately 76 kDa, as determined by 
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SDS PAGE (Figure 5 A) and was present at variable ratios to INf in different preps. This 
protein, here referred to as Biip76 (for 76 kDa IN interactor protein), is specifically 
associated with ESfe since, it could not be immunoprectpitated from the parental 293T 
cells with the anti-FLAG antibody (compare lanes 2 and 3 in Figure SA). When undiluted 
S nuclear extract from 293 T-DSTalaELAG cells was imnnmoprecipitated with anti-FLAG 
antibody for 4 hours both INf and Inip76 bands were readily det e ct ed (Figure 5B). 
Although extending hmnunoprecipitation to 16 hours improved EM* recovery* the yields 
of Inip76 were greatly reduced (compare the lanes 1 and 2 in Figure 5B). Iniriguingly, the 
p300 d band, detected after DTSSP cross^inldng of the nuclear salt extracts, was readfly 
10 observed when the Ihip76-containing INf preparations were cross-linked with DTSSP 
(Figure 6E\ suggesting that Inip76 is part of a large IN complex present in the nuclear 
extra c t s . 

Example 16: Identification of the Tmp76P rn^in nx T.Rn(lPmWR7Q/p75 

IS By up scaling immunopredpitadon we were able to isolate sufficient amounts of Inip76 
for characterization by Ed man degradation and mass spectrometry (Figure 6A). The 
amino terminal sequence obtained from Inip76 was XXRDFKPGD (the first two residues 
being not resolved due to background noise). Scanning the TrEMBL protein database for 
human proteins carrying this sequence tag (Wilkins et al., 1996) resulted in four hits with 

20 accession numbers 095368, Q9UER6, Q9NZB, 075475, all corresponding to. the two 
alternative products of one gene: L£D($F/DFS70/p75 (lens epithelium-derived growth 
factor, referred hereafter as LEDGF) and the pS2 protein ((Ge et al, 1998; Singh et ai, 
2000)). Although the actual MW of LEDGF is approximately 60 kDa, it has been 
reported to migrate as a 75 kDa band in SDS PAGE gels (Ge et aL, 1998). On-line liquid 

25 chromatography tandem mass spectrometry analysis (LC/MS/MS) of tryptic peptides 
obtained by in-gel digestion of Inip76 provided further evidence that Inip76 is indeed 
identical to LEDGF. Lens epithelium-derived growth factor (LEDGF), is a member of the 
epatomardertved growth factor family, is found at low levels in many actively dividing 
and long lived cells. Its gene yields tow proteins, LEDGF/p75 and pS2, by alternative 

30 splicing (Ge, H., Si, Y. f and Roeder, K G. (199S) EMBO I. 17, 6723-6729 and 2). 
LEDGF belongs to a family of homologous proteins including hepatonte-derived growth 
factor (HDGF) (Nakamura, H, Izumoto, Y, Kambe, H, et al (1994) Molecular dining of 
complementary DNA for a novel human hepatoraa-derived growth Factor; its homology 
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with high mobility group-l protein J Biol Cfaem 269, 25143-25149) and HDGF-related 
protein-1 and -2, (Izumoto, Y, Kuroda, T, Harada, H; KisWuaoto, T. Nafcimura, H. (1997) 
Hepatonw-derived growth factor belongs to a gene family in mice shewing significant 
homology in the amino tenninus Biochem Biophys Res Coraraun 238,26-32). 
Half of the predicted LEDGF tryptic peptides within the mass range of 1000-2500 Da 
could be identified in the sample and their MS/MS spectra readily matched LEDGF 
covering approximately 18% of its sequence (Figure 6B and Tablet). Moreover, Imp76 
strongly reacted with a commercially available monoclonal anti-LEDGF antibody (see 
below and data not shown). Most of present in nuclear extracts could be 
iraraunoprecipitated with the anti-LEDGF antibody, whereas only about 10 % of LEDGF 
could be recovered with the anti-FLAG antibody (Figure 6Q, suggesting that LEDGF is 
present in an excess over IN r in the extract Non-tagged IN could also be efficiently 
precipitated with the anti-LEDGF antibody, and a fraction of the LEDGF could be 
precipitated with polyclonal anti-lN antibody in similar conditions from nuclear salt 
15 extracts prepared from 293T-IN*ala cells. 

Example 17: Infn7H pari of the 6! A HIV IN camp le 

To determine whether the 61 A complex contains Inip76/LEDGF, we pre-incubated the 
nuclear salt extract from 293T-lNalaFLAG cells with a monoclonal anti-LEDGF 

20 antibody prior to chromatography on a Superdex column. The IN, elation profile changed 
dramatically: the peak eluted now near the void volume of the column (Figure 6T>). 
Elution of the 61 A complex was not altered by pie-incubation of the extract with an 
unrelated mouse IgGl (Figure 6D). Predictably, elution of the 41 A IN complex (the 
presumed IN tetramer) did not change after pre-incubation of the diluted nuclear extracts 

25 with the anti-LEDGF antibody. 

When INrInip76 complex was purified by unmunoprecipitation and cross-linked 
with DTSSP, the p300 d band could be readily detected in an immunoblot with anti-JCN 
antibody (Figure 6E). However, p300 d did not react with a monoclonal anti-LEDGF 
antibody; instead a western blot with the anti-LEDGF antibody revealed two bands 
migrating at higher positions in the gel (pHMW^ and pHMwV* Figure 6E) (the MWs of 
these molecules are too high to be determined from SDS PAGE). These results suggest 
thatp300 d isaproductofmcompletecross-Iinkmg 61 AlN-Imp76 complex. Being 
composed of IN alone, p300 d probably represents an octamer of IN (a dimer of 
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tetramers). Contacts between Inip76 and IN within the 61 A complex may be less prone 
to cross-linking with DTSSP than those between IN protomers. We cannot, however, 
exclude the possibility that the target epitope for the monoclonal anti-LEDGF antibody 
used is masked or destroyed within p30Q* In addition to the major p3 00^ product, a band 
5 at a position close to pHMWV* is present on the anti-IN immunoblot of Hie purified and 
cross-linked INrfnip76 complex (Figure 6E, lane 2). Thus, the pHMWi* adduct is 
probably the smallest cross-linked IN complex containing Inip7<5. 

Exam ple 18: Inip76 co-localizes wi th INwi&an nuclei of29$T-RF alaFIAG cells 

10 Li pi ^"imalian cells, LED GF is expressed as a nuclear protein; it is distributed in a diffuse 
manner throughout the nucleus during interphase and is stably associated with condensed 
chromosomes throughout metaphase and anaphase (Nishizawa et al., 2001). A similar 
subcellular distribution has been reported for HIV-l IN (Cherepaaov et *L> 2000). 
Immunofiuorfflcent detection of both IN* and Inip76/LEDGF in fixed 293T-IN B ala cells 

15 revealed strikingly similar intranuclear distribution patterns for both proteins (Figure 7A)> 
In accordance with previous reports, both proteins were bound to condensed 
chromosomes in mitotic cells (Figure 7B). The distribution of another nuclear protein, the 
catalytic subunit of DNA-dependent protein kinase (DNAPKcs) dearly differed from 
that of INf (Figure 7C). In addition, DNAPKcs was excluded from condensed 

20 chromosomes in mitotic cells (data not shown). Intriguingly, the nuclear localization of 
INr and Inip76 did not correspond to the overall DNA steuiwg pattern, arguing against the 
possibility that the apparent co-localization of the two proteins might be merely due to 
their independent association with chromosomal DNA In a control experiment* we 
visualized INf using a mixture of polyclonal and monoclonal anti-FLAG antibodies; the 

25 obtained two-color INf staining was similar to that of INf and Inip76 (data not shown). 

Example 19: Trdp76 is an activator of HIV-l IN in vitro 

We have previously described activities of recombinant HIV- 1 IN on the mini-HIV 
substrate, a linear 4.7 kb double stranded DNA molecule, carrying the U3 and U5 
30 terminal fragments of the viral LTR sequences (Cherepanov et al., 1999). Recombinant 
HIV-l IN on itself was proficient in carrying-out cleavage and strand transfer using this 
long DNA substrate; however^ presence of 5-12% polyethylene glycol (PEG) in the 
reaction was required for the enzymatic activity. To ascribe a possible function to the 
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observed Inip76-IN interaction, we examined whether Ihip76 could modulate enzymatic 
activity ofHIV-1 IN in vitro. The mini-HIV substrate was incubated with recombinant IN 
and Hiss-tagged Inip76 in the absence of PEG, and the reaction products were analyzed 
by native agarose gel electrophoresis (Figure 6F). In the absence of Inip76, strand transfer 
5 products were almost undetectable (lane 3). Addition of Mp76 resulted in a potent 
stimulation of the reaction, m some conditions, approximately half of the substrate DNA 
was converted into various reaction products including those that were too large to enter 
the gel (lanes 7 and 8). Remarkably, both the overall efficiency of the reaction and me 
range of the strand transfer products depended on the concentration of Intp76. No 
10 significant variation in the yield of the strand transfer products was detected when the 
order of addition of lhip76 and IN to the mini-HIV reaction was reversed (data not 
shown). Although addition of Inip76 to IN reactions containing short oligonucleotide 
substrate stimulated the strand transfer activity, presence of PEG and DMSO was still 
essential for efficient Mg^-dependent activity of our recombinant enzyme preparation 
15 (Cherepanov et aL, 1997)). 



gx<mpte 2Q: Disruption of ike nuclear and ch r omosomal localization nf HW TN i,smv 
small interfering RtfA molecules spe cific to the Zni P ?6 ™kna 

Immunofluorescent detection of Inip76 and IN revealed that Inip76 expression 
was efficiently knocked-down in the majority of cells transfected with 76A, 76B and 76C 
molecules. Moreover, IN re-distributed from the nuclei to the cytoplasm of the transfected 
cells as illustrated in the FIG. SB. Importantly, after knock-down of Ihip76 expression, IN 
was not able to bind to chromosomal DNA, which was evident in all mitotic cells with 
condensed chromosomes (FIG. 8Q. The distribution and expression of IN and Mp76 
remained unchanged in mock-transfected cells or cells transfected with NC KNA (FIG. 
8A). 



Qxampte 21: Rycombmant Ir m7$-IN Co m p lex PrnAmed in Escherichia caU: Activity qf 
the Purified Recombinan t 1N-Tni p76 Co^pl^ 

The purified complex was supplemented with the IN mini-HIV substrate in the 
conditions similar to the mini-HIV reactions described above. The final reactions 



22 



|0050608..22.-0ct^02 fp;39~| 



contained 1.5, 0.75, 037, 0.18 and 0 fig of the IN-Inip76 complex, 150 ng of the mini- 
HTV substrate in the final volume of 20 jj! Hie reactions also contained llOmMNaCi, 
5 raM MgCb, 5 mM DTT, 20 mM Hepes, pH7.5. After 90 min incubation at 37 °C the 
reactions mixtures were supplemented with 0.2% SDS, 2mM EDTA plus 5 jig of 
proteinase K and farther incubated at 37 °C for 20 min, to disrupt all protein-DNA 
complexes. The reaction products separate! in 0.8% agarose gel were detected by 
staining with the SybrGold stain (Molecular Probes) (FIG. 10EB). The activity of the 
complex was similar to the activity of IN in the presence of separately added Inip76 as 
shown above. 
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Discussion 

The minimal nuclear HIV-1 IN complex is a homotefcramer 

A serious obstacle in working with recombinant retroviral integrases is their poor 
5 solubility and propensity for aggregation. When produced in bacteria, HIV IN has to be 
extracted from inclusion bodies; detergents are used to stabilize the enzyme in solution. 
All crystal structure and some in vitro muhimerization studies have been done with the 
soluble mutants. Furthermore, recent reports raised concern that the stoichiometry and 
enzymatic activities of recombinant IN can be affected by the enzyme preparation (Leh et 
30 aL, 2000; Sinha et al., 2002). Our goal was to study protein complexes that HIV-1 IN 
forms within nuclei of human cells. The bulk of HEV-1 IN present in 293T cells that 
stably produce this viral protein, is associated with the insoluble nuclear fraction. 
Although IN seems to be directly or indirectly bound to chromosomal DNA, this may not 
be the only fector in nuclear retention of IN, since digestion of the detergent- 
15 permeabilized nuclei with nucleases was not sufficient to elute IN. In this work, we 
concentrated on the study of IN complexes extracted from the detergent-penneabiUzed 
nuclei in hypertonic conditions. We found that saJtt-etoted IN exists as part of a distinct 
61 A complex, which is not stable in diluted nuclear extracts and dissociates, releasing a 
41 A core molecule. Our cross-linking and gel filtration data suggest that the latter 
20 molecule is a tetraraer of IN. Indeed, the complex exists in jtoomunoprecipitated IN? 
preparations. Its molecular weight determined both by gel filtration and cross-linking 
(115-150 kDa) is close to the value expected for the presumed tetramer (128 kDa). 
Finally, partial cress-linking of the diluted nuclear extracts and of the purified tagged IN 
protein produced bands at the positions corresponding to the IN monomer, dimer and 
25 triraer, in addition to pi SO" 1 (data not shown and Figure 2A, lane 7). We estimate that the 
concentrations of IN and ESTr in the nuclear extracts did not exceed 10 nM (< 03 ug/ml IN 
at 100 ug/ml total protein). Therefore, the IN tetramer was stable even at subnanomolar 
concentrations (Le. in the extracts diluted to 4 ug/ml of total protein), implying that the 
minimal nuclear IN complex is a tetramer 



30 



Composition of the 61 A HIV-1 IN complex 

The native 61 A IN complex was found to decay during prolonged incubations and not to 
be stable during purification. When the immunoprecipitation time was shortened and 
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higher initial extract concentrations were used, a cellular protein with an apparent MW of 
76 kDa specifically copreciprtated with the ephope-tagged IN, The protein, or Inip76, as 
we refer to it, was found to be identical to human LEDGF/DFS70/p75 proton. We further 
confirmed that Inip76/LEDGF is apart of the native 61 A IN complex, since addition of a 

5 monoclonal anti-LEDGF antibody shifted ehition of the complex from a gel filtration 
column. Importantly, when LEDGF protein was present, the $300* band became visible 
after DTSSP cross-linkmg of the immunoprecdpitated IN*. This band could not be 
reproduced in LEDGF-negative INjr samples obtained following the long 
immunoprecipitation protocol. ttOtf* seems to be the result of partial cross-linking of the 

10 61 A IN complex for the following three reasons: (*) the MW was lower than the expected 
value for a 61 A globular complex (*4G0 kDa); (if) DTSSP cross-linking products with 
apparent MWs higher than that of p3Q0* were visible on anti-IN immunoblots; (to) this 
product did not react with anti-LEDGF antibody. Based on its apparent molecular weight, 
we speculate that plOtf* probably represents two cross-linked IN tetramers. 

15 What is the exact stoichiometry of the IN-Inip76 complex? At the concentrations 

it is present in nuclear extracts, the 61 A complex -was not stable enough to allow 
measurement of its sedimentation coefficient which is required to determine its precise 
MW (Siegel and Monty, 1965), Assuming that the 61 A complex is globular, we estimate 
its MW to be around 400 kDa. The simplest model compatible with this MW is of a 

20 symmetrical complex containing a pair of IN tetramers and two subunits of Inip76, which 
corresponds to a macromolecule of 370 kDa. Reconstitution of the 3N-Inip76 complex 
from the recombinant proteins will help to confirm the proposed stoichiometry. At this 
time, we cannot rule out the possibility that the native 61 A complex contains another 
cellular protein lost during immunoprecipitation. The purified Loip76-co0taimng IN 

25 samples displayed complex gel filtration profiles, probably due to partial dissociation of 
the native complex. 

Retroviral IN within PIC: a dimer of dimers or a dimer of tetramers? 
During reverse transcription, the two retroviral cDNA termini are not completed 
30 simultaneously and both seem to be substrates for the 3 '-processing activity of IN as soon 
as they appear (Miller et al, 1997). Moreover, at least in case of HIV, 3*-processing of 
one LTR end was observed when the second end was non-functional and also did not 
support normal intasome assembly (Chen and Engelman, 2001). On the other hand, two 
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functional LTRs were found to be required for strand transfer activity of isolated HTV 
PICs. Therefore, although LTRs can be processed asymmetrically, a synaptic complex 
involving both LTRs must form to allow strand transfer, ensuring that only legitimate 
integration of both retroviral cDNA ends occurs. Based on a comparison to Ma phage 
5 transposase and available crystal structure data, it has been suggested that the active form 
of retroviral 3N is a tetramer (a dimer of dimers) (Wang et al., 2001). Our results suggest 
that HIV IN expressed in human cells is indeed present as a stable tetramer. Recombinant 
HIV- 1 IN as well has been recently reported to form stable tetramers in diluted solutions 
(Leh et aL, 2000). Intriguingly, both Mi phage and Tn5 transposases form active 
10 multimers (tetramers and dimer respectively) only within their synaptic completes. Thus, 
independent toansposase subunhs must first bind to the ends of the Ixansposon genome, 
before being brought together to form me synaptic complex. Extrapolating this scheme to 
the retroviral PIC assembly, the stable IN tetramers can be looked at as such independent 
subunits, which first have to bind to one LTR end each before interacting with each other. 
According to this model, each individual tetramer would be capable of currying out 3'- 
processing reaction, whereas a dimer of tetramers would be necessary to accomplish the 
strand transfer. The data presented here suggest that the cellular HTV-1 IN 61 A complex 
contains two IN tetramers that are close enough to be cross-linked to each either. It is 
therefore very tempting to speculate mat this dimer of tetramers may represent the 
configuration of IN required for concerted integration. 

What is the role of Inip76 in retroviral replication? 

The human Ini l/hSNF5/B AF47 protein has been shown to interact with HTV-1 IN in a 
yeast two hybrid screen and in vitro. Neither did we observe co-immunoprecipftation of 
Inil with IN from nuclear extracts; nor were we able to detect co-localization of the two 
proteins in foe cell (to be published elsewhere). We speculate that the cellular IN-foil 
interaction might be transient and only occurring during certain steps of viral replication. 
We have now shown that IN complexes extracted from the cell lines stably expressing 
HIV-1 IN contained another human protein, foip76, identical to LEDGF/DFS70/p75. IN f 
and Ihip76 co-localized in the nuclei of 293T-IN*alaFLAG cells, confirming that the 
observed interaction was not an artifact of the extraction. So for, there has been no 
previous account of a link between L£DGF/DFS70/p75 and retroviral replication. No 
obvious relation seems to exist between this protein and known chromatin remodeling 
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factors. The protein has first been described as the positive transcription cofactor PC4- 
int erecting protein (Ge et al., 1998). Two alternatively spliced cDNA clones were isolated 
from a HeLa library coding for two proteins p52 (333 amino acids, as predicted from the 
cDNA) and p75 (S30 aminoacids) sharing identical 325 amino terminal residues. The 
transcripts coding for p75 and p52 were detected in different cell types and tissues, p52 
being most abundant in testis and p75 in thymus. Independently, a cDNA clone coding 
for a proton identical to p75 has been isolated from a lens epithelium cell library (Singh 
et al., 1999). Overexpression of the protein stimulated survival of diverse primary cells 
and cell lines and enhanced their resistance to oxidative and hyperthermic stress (hence 
LEDGF, "lens epithelium derived growth factor"). The same protein has also been 
identified as the DFS70 autoantigen, antibodies to which were found in some cases of 
atopic dermatitis, asthma and interstitial cystitis (Ochs et aL, 2000). One case of acute 
myeloid leukemia with a chromosomal translocation resulting in a fusion of the NUP9S 
and LEDGF genes has been described (Ahuja et al., 2000). 

Based on sequence similarity, Inip76/LEDGF/DFS70/p75 is a member, of the 
hepatoma-derived growth motor (HDGF) family that includes HDGF and several other 
HDGF-related proteins (HRPs) (reviewed in (Diets et al., 2002)). High degree of 
homology exists between the amino terminal regions of these proteins. The FWWP motif 
(70 residues containing the Pro-Trp-Trp-Pro core sequence) is located within the amino 
terminal homology region of HRPs and relates them to a larger and functionally diverse 
nuclear protein family, that includes DNA-binding transcription factors and enzymes 
involved in DNA repair and DNA raethylation. PWWP domains are thought to be 
implicated in protein-protein interactions (Stec et aL, 2000). 

An eye-catching feature of Inip76 is the abundance of the charged amino acids 
comprising approximately 40% of the total residues. Screening its sequence against the 
BLOKS+ protein motif database (Henikoff et al., 1999) revealed fragments with 
similarity to the HMG-I(Y) DNA AT hook sequence (data not shown). LEDGF has been 
shown to be a DNA-binding protein with affinity for heat shock and stress-related DNA 
elements (HSE and STKEs) (Singh et aL, 2001). It has also been reported to interact with 
componestrts of the general transcription machinery and with the transcription activation 
domain of VP 16 (Ge et al., 1998). Recent reports suggested that LEDGF plays an 
important role in regulating expression of the stress-response genes (Fatraa et al., 2001; 
Shinohara et aL, 2002). As mentioned above, alternative splicing of LEDGF pre-mRNA, 
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allows expression of the second protein, pS2, from the same gene. A growing body of 
evidence suggests that p75 and p52 may have different Amotions. Although they both can 
interact with PC4, VP 16 and general transcription factors, at least in vitro, p52 displays 
higher transcription activation activity (Ge et aJ., 1998). In addition, p52 and not p75 has 
5 been shown to fictionally interact with the ASF/SF2 splicing factor in vitro (Ge et aL, 
1998). The proteins also differ in their nuclear distribution patterns (NisMzawa et aL, 
2001). Intracellular levels of p52 appear to be much lower than those of p75, at least in 
the cell lines we have tested (HEK-293, 293T, HeLa, CEM) (data not shown). We have 
not detected co-immunoprecipitation of p52 with IN from the nuclear cell lysates of IN- 
10 expressing 293T cells. However, it remains to be determined whether pS2 is able to 
interact with HTV IN. 

What is the role of the Inip76-IN interaction in the retroviral life cycle? 

The feet that recombinant Inip76 was able to stimulate HTV-1 IN activity in vitro suggests 

15 a direct involvement of Inip76 in the integration process. As a chromosome-associated 
IN-bmding protein, Inip76 may serve as a docking factor for the PIC. It might thus be 
functionally similar to the Mu phage transposition co-factor MuB that associating with 
the acceptor DNA makes it a preferred target for transposition. It is also conceivable that 
Inip76 might be a part of the retroviral PIC. The experiments are currently underway to 

20 establish whether Inip76 is an essential factor in HTV* replication. If proved to be 
essential, Inip76 will constitute a novel target for anti-retroviral therapy directed against 
its interaction with IN. Alternatively, a therapeutic strategy based on modified Inip76 
protein, designed to capture the viral IN in a catalytically quiescent complex, may be 
envisaged. 
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Legends to the Figures 

Figure 1. Extraction of IN from nuclei of 293T-IN*ala cells. 

(A) 293T-IN s ala cells were tysed in 100CSK buffer in the presence of 4 ug/ml digitonin 
5 or 0.5% NP40 on ice for 10 minutes. The supernatant (S) and nuclear pellet (P) fractions 

were recovered and analyzed by western blotting with an anti-IN antibody. The first lane 
contains the total cell extract 

(B) 293T-IN*ala cells were lysed in 100CSK buffer supplemented with 0.5% NP40 on ice 
for 10 minutes, and the extracted nuclei were re-suspended in CSK buffer co ntaining 0 - 

10 500 mM NaCl. After centri&gation, supematants (S) and nuclear pellets <P) were 
analyzed by western blotting with an anti-IN antibody. The total nuclear protein was 
loaded in the first lane (nuc). 

(C) NP40-permeabilized nuclei from the 293T-IN I ala cells, prepared as above, were 
incubated in 100CSK buffer with (+DNase I) or without (-DNase I) DNAsel 

15 (250 units/ml) at 25 °C for 10 (10') or 30 (30*) minutes. Following cerraifugation, the 
supematants (S) and pellets (P) were separated in an 11% SDS PAGE gel, the upper part 
of which was used for the immunoblot to detect MCM3 (91 kDa) and the lower part to 
detect IN (32 kDa). The total cytoplasmic and nuclear protein fractions were loaded in the 
first (cyt) and the second (nuc) lanes, respectively. 

20 (D) NP40-permeabilized nuclei from 293 T-H-Pala cells were incubated in 100CSK buffer 
with or without DNase I (250 units/ml) for 10 minutes, pelleted by centrifogation and 
resuspended in ice-cold hypotonic buffer (2 mM EDTA, 2 mM Hepes [pH7.S]). After 
centruugatjan, supematants (S) and pellets (P) were analyzed by western blotting with 
anti-IN antibodies. The first lane (nuc) contains total nuclear protein. 

25 

Figure 2. Cross-linking of the IN and EN f complexes with DTSSP. 
(A) The nuclear extract from 293T-IN*ala cells was prepared in the 400CSK buffer, was 
cross-linked with DTSSP and separated in a non-reducing 4-12% SDS PAGE gel. The 
IN-containing cross-linking adducts were detected by western blotting with a polyclonal 
30 anti-IN antibody. Prior to cross-linking, the extract was adjusted to 100 ug/ml (lanes 1-5X 
20 ug/ml (lanes 6-9) or 4 ug/ml (lanes 10-13) of total protein. The concentration of 
DTSSP was 0.1 mM (lanes 3, 7, 11), 0.S mM (lanes 1, 4, 8, 12) or 2.0 mM (lanes 5, 9, 
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13). No cross-linker was added to the samples in lanes 2, 6 and 10. The sample in lane 1 
was cross-linked in the presence of 0.2% SDS. 

(B) Cross-linking of the FLAGtagged IN complexes was done in similar conditions as 
in (A). Only lanes containing samples cross-linked with 2 mM DTSSP are shown; the 
sample in lane 1 was cross-linked in the presence of 0.2% SDS. The p300 d , pX50 d , p60 d , 
the IN monomer bands and the positions of the MW markers are indicated. The «470 kDa 
mark corresponds to the band of the catalytic subnnit of DNAPK (MW 469 kDa), which 
was detected in a separate lane with a monoclonal anti-DNA PKcs antibody. 



Figure 3. Determination of sizes and molecular weights of the IN complexes. 

(A) Chromatography of nuclear extracts from 293T-J0N*aIa cells was carried out on a 
calibrated Supeidex 200 column. Prior to chromatography, the extract was adjusted to 
550 ug/ml or 30 ug/ml of total protein; the collected fractions (1-26) were tested for the 
presence of IN by western blotting. The elution volumes (V«) and the respective partition 
15 coefficients <Ka») for the observed IN peaks are indicated. 

(B, C) Determination of the Stokes radii and approximate molecular weights of the IN 
complexes from the experimental K„ values. The partition coefficients for the standard 
proteins were determined in the same conditions (thyr., thyroglobulin, K«v - 0.039; 
ferritin, = 0.17; catalase, K*y = 0.26; aldolase, K*, - 0.28; BSA, bovine serum 
20 albumin, = 0.39; chymotr., chymotrypsinogen A,K* V = 0.59). 

Figore 4. Imraunoprecipitation of the FLAG-tagged IN from diluted nuclear 
extracts. 

(A) Nuclear extracts prepared from metabolically labeled 293T-IN*ala and 293T- 
25 IN'alaFLAG cells were diluted to 200 ug/ml of total protein and immunopredpitated with 

the anti-W.AG M2 antibody and protein G agarose beads for 16 hours at 4 °C. The protein 
was eluted by boiling in SDS PAGE sample buffer and separated in a 4-20% SDS PAGE 
geL Radioautograph of the gel is presented. 

(B) FLAG-tagged IN was immunopredpitated from a diluted nuclear extract of non- 
30 labeled 293T-INalaFLAG cells overnight. The protein was eluted with FLAG peptide in 

400CSK buffer and cross-linked with DTSSP. The reaction conditions are similar to those 
in Figure 2. 
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(O The protein immimoprecipitated and ehrted as in (B) was subjected to gel filtration on 
a Superdex 200 column. The fractions collected (1-26) -were analyzed by western blotting 
with anti-IN antibodies. The R* value corresponding to the observed peak was determined 
as in Figure 3B. 



Figure 5. Co- immunoprecipitation of FLAG-tagged IN with Inip76 from non^diluted 
nuclear extracts. 

(A) Nuclear extracts from 293T-ItfalaFLAG and 293T celts (700 jig/ml of total protein) 
were incubated with anti-FLAG M2 antibody and protein G agarose beads for 4 hours. 

10 The beads were washed as described in materials and methods and bound proteins were 
ehrted with FLAG peptide in 400CSK buffer. The eluted proteins were concentrated by 
precipitation with TCA, re-dissolved in SDS sample buffer, separated in 4-20% 
denaturing PAGE gels and visualized by silver staining. Lane 1: immunoprecipitstc of a 
nuclear extract from 60x10* 293T-IN*alaFLAG cells. Lanes 2 and 3: 

15 immunoprecipitation was done in parallel with nuclear extracts from 293T and 293 T- 
DSPalaFLAG cells. The bands of IN& Inip76, the heavy and the light chains of the anti- 
FLAG M2 IgGl antibody as well as aprotinin (protease inhibitor present in 400CSK 
buffer) are indicated. Two sets of molecular weight markers were used in both gels to 
determine the apparent molecular weight of Inip76. Positions of the molecular weight 

20 markers are shown. 

(B) Nuclear extract from 20 xlO* 293T-IN a alaFLAG cells («700 ^ig/ml total protein) was 
immunoprecipitated for either 4.5 hours (left lane) or 18 hours (right lane). The protein 
was then eluted and analyzed as in (A). 

25 Figure £» Inip76 is identical to LEDGF/DFS70/p7S, appears to be a component of the 
61 A complex and stimulates IN activity in vitro, 

(A) The Coomassie blue-stained PVDF membrane used for ammo-terminal 
microsequencing of Inip76. The INrInip76 complex was eluted from protein G agarose 
immobilized anti-FLAG M2 antibody with FLAG peptide, separated in reducing 4-20% 
30 SDS PAGE gel and transferred onto the PVDF membrane (lane 1). The proteins left on 
the beads after incubation with FLAG peptide were eluted with SDS sample buffer Qvsie 
2). The bands corresponding to Inip76, ESfe the heavy (H) and light (L) chains of the anti- 
FLAG M2 antibody, aprotinin and the MW markers are indicated. 
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(B> Deconvoluted and sequence-converted MS/MS spectrum of a doubly charged peptide 
ion (mfe 982.57) obtained from the jn-gel tryptic digest of Mp76 coxxesponding to the 
LEDGF peptide N425-K442. The observed b- and y-dominant fragment ions are 
indicated (Biemann, 1990). 

(Q Co-immunoprecipitation of LEDGF and FLAG-tagged IN from a nuclear extract of 
293T-IN'alaFLAG cells. Immunoprecipitation was carried with anti-FLAG (lanes 2 and 
2'), anti-LEDGF (lanes 3 and 3') or no antibody (lanes 1 and 1'). After 4 hours 
incubation, protein G agarose beads with precipitated protein complexes were washed 
with three changes of 400CSK buffer and re-suspended in reducing SDS PAGE sample 
buffer, followed by western blotting to detect INf and Inip7cVLEDGF (lanes l*-3*). The 
lanes 1-3 contain the immunoprecipitation supernatants. 

(D) Ehition of the 61 A IN complex from a gel filtration column is shifted after pre- 
incubation with anti-LEDGF antibody. A nuclear extract of 293T-IN s alaFLAG cells was 
preincubated with 3 jig/ml anti-HA (control IgGl) or anti-LEDGF antibody and separated 

15 by chromatography on a Superdex 200 column. INf was detected in the fractions by 
western blotting. The void volume of the column was 8.3 ml, approximately 
corresponding to fraction 2. 

(E) Cross-linking of the Ihip76-INr complex with DTSSP. The Inip7o7INf complex was 
iramunoprecipitated from a nuclear extract prepared from 293T-IN 1 alaELAG cells with 

20 anti-FLAG M2 antibody and protein G agarose for 4.5 hours. The protein was ehited with 
FLAG peptide and incubated with 2 mM DTSSP in the presence (lanes 1 and 1') or 
absence (lanes 2 and 2') of 0.2% SDS. The cross-linked samples were then separated in a 
non-reducing 4-12% SDS PAGE gel and immunoblotted with polyclonal anti-IN (left • 
blot) or monoclonal anti-LEDGF (right blot) antibodies. The positions of IN* and Imp76 

25 as well as the cross-linking adducts plSC*, p300* pHMW^ and pHMW* d are indicated. 
Anti-FLAG M2 IgGl present in the sample is detected on the anti-LEDGF western blot. 

(F) Recombinant Ioip76 enhances HTV-1 IN strand transfer activity in vitro. Mni-HIV 
DNA was pre-incubated with HTV-1 IN for 7 minutes at room temperature. Next, 
0-0.8 mM His6-tagged Imp76 was added to the reactions that were further incubated at 

30 37°C for 90 minutes. The concentrations of EST and Inip76 used in the reactions are 
indicated. The reactions contained 125 ng mini-HIV DNA, 1 10 mM NaCI, 20 mM Hepes 
[pH 7.5], 5 mM DTT and 5 mM ZaCk in a final volume of 20 ul The reactions were 
stopped by addition of 0.5% SDS and 25 mMEDTA and the samples were digested with 
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0.25 mg/ml proteinase K at 37 °C for 30 minutes to completely disrapt proteinhDNA 
complexes. DNA was then precipitated with ethanol, re-dissolved in Tris-EDTA and 
analyzed by electrophoresis in an 0.8% agarose geL The positions of the DNA molecular 
weight markers (23.1, 9.4, 6,6, 4.4 and 2.3 Mb) are indicated The gel was stained with 
5 SybrGoId (Molecular Probes). 

Figure 7. Co-localization of FLAG- tagged IN and loip76/LEDGF in 293T- 
IN*alaFLAG cells. 

(A) Confocal laser scanning micrographs of a fixed and permeabilized cell fluorescently 
10 stained with a combination of monoclonal antMUEDGF phis Alexa-488 conjugated anti- 
mouse antibodies to detect Inip76 (green, Inip76) and rabbit polyclonal anti-FLAG plus 
Alexa-555 conjured anti-rabbit antibodies (red, IN) to localize FLAG-tagged IN. DNA 
was stained with To-Pn>3 iodide (shown as white). The two-color merged image 
(IN+Inip76) was produced by overlaying the IN and Ihip76 images. 
IS (B) Both INf and Inip76 are associated with condensed chromosomes during mitosis. 
Immunofluorescent staining was performed as described in (A). 

(C) DNA PKcs and INf display no significant co-localization- INf (red, IN) was detected 
as in (A); DNA PKcs (red, PKcs) was localized with monoclonal anti-DNA PKcs 
antibody plus Alexa-555 conjugated anti-mouse antibody. The two color IN+PKcs image 
20 is an overlay of the IN and PKcs images. 

FIG. 8. Effect of RNA interference-mediated knock-down of Inip76/LEDGF expression 
in 293T-IN*alaFLAG cells on HTV IN distribution. 

Ihip76 (A, B) and IN (Q were detected by indirect immunofluorescence in non- 
25 transfected cells (A), cells transfected with 76A sflRNA (B, Q. Comiunofluorescence, 
chromosomal DNA staining and detection was done as described in Example VI. 



FIG. 9. Purification of the Ihip76-IN complex from bacteria co-expressing the two 
30 proteins. 

(A) Elution of the IN-Inip76 complex from NiNTA Sepharose; 7 fractions collected were 
analyzed by 11% SDS PAGE and Coomassie staining. (B) A lysate of induced 
PC2LEDGF cells lacking Hise-tagged IN was incubated with NiNTA Sepharose and the 
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bound proteins were eJwted with 200 mM imidazoL The band corresponding to Inip76 is 
not present on this gel, confirming that free Inip76 is not associating with NiNTA 
Sepharose. (C) Further purification of the IN-Inip76 complex by cation exchange 
chromatography. The fractions 3, 4, 5 and 6 ehrted from NiNTA (see panel A). They were 
5 pooled and loaded onto a 1 ml HfTrap SP Sepharose column. The complex was eluted 
with finear gradient of NaCi concentration. The non-bound (flow-through) fraction as 
well as the fractions #10 through #28 were analyzed by 1 1 % SDS PAGE and Coomassie 
staining. Positions of the molecular weight markers (kDa) are indicated. 

10 FIG. 10. Concentration of the enzymatically-actrve IN-Inip76 complex. 

(A) The complex purified by chromatography on NiNTA and SP Sepharose was 
concentrated using Centricon-30. Equal volumes of the original non-concentrated sample 
(lane l), the intermediate samples Games 2, 3) and the final concentrated sample (lane 4) 
were separated by 11 % SDS PAGE. (B) Activity of the purified and concentrated 

15 recombinant IN-Inip76 complex was tested in the mim-HTV reaction in conditions 
described in the Example VII. Positions of the DNA molecular weight markers (kb) 
separated in a separate lane of the gel are indicated. 
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Integrase Tetramers 

Claims 

5 

l.A molecule which comprises a region specifically binding to a protein of the hepatoma- 
derived growth factor family or nucleic adds encoding said protein of the hepatoma- 
derived growth factor family, and suppresses or prevents retroviral replication. 

10 2. The molecule of claim 1, characterised in that it prevents or suppress the protein of the 
hepatoma-derived growth factor family of interacting with retroviral integrase. 

3. The molecule of claim 1 or 2, which comprises a region specifically binding to the 
protein Inip76 or nucleic acids encoding said Inip76, and suppresses or prevents viral 

15 replication. 

4. The molecule of claim 3, characterised in that it prevents or suppress protein Xnip76 of 
interacting with retroviral integrase. 

20 5. The molecule of any the claims 1 to 4, characterised in that it suppresses or prevents 
retroviral replication. 

6. The molecule of any the claims 1 to 4, characterised that it suppresses or prevents HIV 
replication. 

25 

7. A molecule according to any of the claims 1 to 6, which is chosen from the group 
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conxpristog an antibody or any fragment thereof; a small molecule specifically binding to ^ 
Liip76; or nucleic acids encoding said Inip76; a ribozyme against nucleic acids encoding 
Inip76; and anti-sense nucleic acids hybridising with nucleic acids encoding Inip76. 

5 8. Use of molecule according to any of claims 1 to 7 for the preparation of a medicament to 
treat or prevent retroviral infection, 

9. Use of a molecule according to any of claims 1 to 7 for the preparation of a medicament 
to treat or prevent HTV infection. 

10 

10. A method to identify molecules according to claims 3 or 4, comprising exposing Liip76 
or nucleic acids encoding said Inip76 to at least one molecule whose ability to suppress of 
Xoip76 of interacting with a retroviral integrase protein is sought to be determined, 
detennining binding or hybridising of said raolecule(s) to Inip76 or to binding places on 

15 said integrase of Inip76 protein or to nucleic acids encoding said Mp76 monitoring the 
prevention or suppressing of retroviral replication or integration by the usage of at least one 
of said molecules. 

12.A method to identify molecules according to claims 3 or 4, comprising exposing Inip76 
or nucleic acids encoding said Inip76 to at least one molecule whose ability to suppress of 
20 Inip76 of acting as a cellular cofector of retroviral integration is sought to be determined, 
detennining binding or hybridising of said molecule(s) to Inip76 or to binding places on 
said retroviral integrase of Itup76 or to nucleic acids encoding said Inip76 monitoring the 
prevention or suppressing of retroviral integration by the usage of at least one of said 
molecules. 

25 
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13. A method to identify molecules according to claims 3 or 4, comprising exposing Inip76 
or nucleic acids encoding said lnip76 to at least one molecule whose ability to suppress of 
Inip76 of promoting strand transfer activity of HIV is sought to be determined, dete rmining 
binding or hybridising of said molecule(s) to Inip76 or to binding places on said integrase 

5 protein of Inip76 or to nucleic adds encoding said Imp76 monitoring the prevention or 
suppressing of HIV integration by the usage of at least one of said molecules, 

14. A method for the production of a pharmaceutical composition comprising the usage of 
the method according to claims 9-12 and further more mixing said molecule identified, or a 

10 derivative or homologue thereof, with a pharmaceutical^ acceptable carrier. 

15. A protein complex comprising a retroviral integrate and Imp76. 

16. A protein complex comprising a recombinant retroviral integrase and a recombinant 
IS Inip76. 

17 k A method of stimulating retroviral integration using Inip76 or functional analogues 
thereof 

20 18. The use of 3nip76 as a cellular cofactor of retroviral integration. 
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Integrase tetramers 



Abstract 



5 In a study of HIV- J integrase (IN) complexes derived from nuclei of human cells stably 
expressing the viral protein from a synthetic gene it was demonstrated that in the nuclear 
extracts IN exists as part of a large distract complex with apparent Stokes radius of 61 A, 
which dissociates upon dilution yielding a core molecule of 41 A. The IN complexes were 
isolated from cells expressing FLAG-tagged IN. By present invention it was demonstrated 

10 that the 41 A core is a tetiamer of IN, whereas 61 A molecules are composed of IN 
tetramerB associated with a cellular protein with an apparent molecular weight of 76 kDa. 
This integrase interacting protein (Inip76) was found to be identical to LEDGF/DFS70/p75, 
a protein implicated in regulation of gene expression and cellular stress-response. HTV-1 IN 
and Inip76 co-localized in the nuclei of human cells stably expressing IN. Furthermore, it 

15 has been demonstrated by present invention that recombinant Inip76 strongly promoted 
strand-transfer activity of HTV-1 IN in vitro. Our findings reveal that the minimal IN 
molecule in human cells is a tetramer and clearly demonstrates that Inip76 is likely to play 
a role in retroviral integration. 
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